CONSENSUS WITH TERNARY MESSAGES 
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Abstract. Wc provide a protocol for real- valued average consensus by networks of agents which 
exchange only a single message from the ternary alphabet {—1, 0, 1} between neighbors at each step. 
Our protocol works on time-varying undirected graphs subject to a connectivity condition, has a 
worst-case convergence time which is polynomial in the number of agents and the initial values, and 
requires no knowledge about either the graph topologies or the number of agents for implementation. 
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1. Introduction. The average consensus problem asks for a protocol by means 
of which n agents with initial values xi (0) , . . . , x„ (0) can compute the average ^ Y^^^i Xi (0) 
subject to time- varying restrictions on inter-agent communication. Recent years have 
seen a surge of interest in consensus protocols due to their widespread use as build- 
ing blocks for distributed control laws in multi-agent systems; for example, consen- 
sus protocols have been used for formation maintenance [Ml [TOl [7] , coverage control 
[l8l fT9l [3] , network clock synchronization [171 El [13 , distributed task assignment and 
partitioning [8] , statistical inference in sensor networks [H |25l [9] , and many other con- 
texts in which centralized control is absent and agent motion and varying interference 
can lead to repeated failures of communication or sensing. 

Consensus protocols typically involve each node i maintaining a variable Xi{t) 
which is updated from time i — 1 to time t by setting Xi (t) to be a convex combination 
of those Xj{t — 1) for which j and i are neighbors in some undirected graph G(t) = 
({1, . . . , n}, E{t)). The graph sequence G{t) is meant to capture the constraints that 
dictate which pairs of agents can communicate or sense each other at each time. It is 
assumed that each G{t) has a self- loop at every node, so that Xi{t) always depends on 
its previous value Xi{t — 1). A typical update is the Metropolis iteration (introduced 
within the context of consensus in |25j). defined as 

x,{t) = x,{t~i)+ y '''^'^'}]7''j\\,^^ (1.1) 

^' ^ ' max(d,(t),d,(t)) ^ ^ 

where Niit) is the set of ncigbors of node i in G{t) and di{t) is the degree of node i in 
G{t). It is assumed that the graph sequence G{t) satisfies the long-term connectivity 
condition 

For each fc, the graph ({1, . . . , n}, \j'^f.E{t)) is connected, (1-2) 

meaning that the communication restrictions faced by the nodes do not disconnect 
the network into noncommunicating groups after some finite time k. Under this 
assumption, the Metropolis iteration of Eq. (|l.ip has the property that 
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meaning that all agents suceed in converging to the average. We refer the reader to 
the papers [H [T^l HOI HH [Ml [IS for proofs of this and similar assertions. 

Consensus protocols have proven to be useful for multi-agent control due to their 
attractive robustness properties (namely, that the communication sequence G{t) can 
vary arbitrarily subject to the relatively weak long-term connectivity constraint of 
Eq. (11.21) ) and their local, distributed nature (i.e., every agent updates based only 
on neighboring agents). A number of recent advances in multi-agent control have 
proceeded by reduction to an appropriately defined consensus problem [3l I10[ [71 117| 
[53]. Moreover, we note that under slightly stronger assumptions on the sequence G{t), 
it is possible to design consensus algorithms with convergence time bounds which are 
polynomial in the number of agents n |21) . 

However, a limitation of consensus protocols lies in the assumption that agent i 
can update Xi{t) as a function of the states Xj{t — 1) of its neighbors in the graph 
G(t). While this is very natural in some settings (such as formation or coverage 
control, where Xj(t) represents the position of agent j, which neighboring agent i can 
be reasonably assumed to sense) in other contexts (such as clock synchronization or 
statistical inference in sensor networks) this implies that the agents can exchange real 
numbers at every step, a clearly unrealistic assumption. 

The focus of this paper is to rectify this by proposing a consensus protocol in 
which the nodes exchange a finite number of bits with their neighbors at each time t. 
Specifically, we provide a deterministic protocol for the nodes to exchange messages 
with neighbors and update Xi(t) which has the following properties: 

1. For every time- varying sequence G{t) subject to a certain connectivity as- 
sumption, each Xi (t) converges to the average of the initial values i X]r=i ^» ■ 

2. The number of bits that is transmitted from agent to agent at each time t 
is bounded above independently of t, n and all other problem parameters (in 
fact in our protocol the agents transmit a single element from the ternary 
alphabet {—1,0, 1} to each neighbor at every t). 

3. The protocol does not require knowledge of either the graphs G{t) or even of 
the number of nodes n. 

4. The protocol has a worst-case convergence time which is provably polynomial 
in the number of nodes n and the initial values Xi (0) . 

A number of consensus protocols in which agents exchange finitely many bits at 
every step have recently been proposed |4l[6l[TTl[T3l[2Tl[T5l[l6]. However, each of these 
protocols lacks at least two of the features (l)-(4) above. We remark that features (1) 
and (3) are particularly crucial since much of the attractiveness of consensus protocols 
arises from their abilities to cope with link failures and their local, distributed nature. 
Feature (4) is clearly useful: the appeal of any protocol is increased by the availability 
of polynomial worst-case convergence bounds. Finally, feature (2) ensures that we do 
not implicitly assume that an arbitrarily large amount of data can transmitted on 
each link before the graph changes. 

We begin by stating our main result in Section [21 where we provide an informal 
description as well as a formal statement of our protocol. The proof of the main 
convergence theorem is given in the subsequent Section [3] Some simulations of our 
our protocol are provided in Section |4l Finally, Section [5] provides some conclusions 
and lists some open problems. We note in particular that our new consensus protocol 
makes more stringent than usual assumptions on the graph sequence G{t) and has 
more storage than the ordinary consensus algorithms, and we pose the improvement 
of these features as open problems. 
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2. Our results. We first reprise the notation we have introduced: G{t) is a 
sequence of undirected graphs with a self-loop at every node, Ni{t) is the set of 
neighbors of node i in G{t), and di{t) is the degree of node i in G{t). 

2.1. Intuitive description of the protocol. We begin by informally describ- 
ing the idea behind our protocol. We would like to run the Metropolis update of 
Eq. (|l.ip . but without the ability to transmit real numbers, node i will not know the 
values of its neighbors Xj{t~ 1) exactly. Consequently, node i will maintain estimates 
of Xj{t); we will use the notation Xi_j^in{t) for the estimate that node i has for Xj{t). 

A crucial point is that node i will be aware of the value of any neighbor j's 
estimate of Xi{t) because this estimate will be built from the messages node i has 
previously sent j. Thus node i will maintain the variable Xij^ontit) which will equal 
node j's estimate of Xi{t), i.e., will equal Xj^iSnit). 

At each time t, node i will receive a message from j from the alphabet {—1, 0, 1}. 
If it receives a -1-1, it will add to Xij^in, if it receives a —1, it adds a — to the 
same; and if it receives a zero, it leaves Xij^-m unchanged. Naturally, node j decides 
to send a 1, 0, —1 depending on whether a?ij,in is too low by at least l/t", within 
of the true value, or too high by a factor of at least l/t", respectively. Note that even 
though it is node i which maintains the variable Xij-_in, node j knows what it is by 
the previous paragraph. 

The number a will be in (0,1) so that 



which will be key in ensuring that the estimates Xi.j^in increasingly become accurate 
for links that regularly appear. 

Meanwhile, the nodes will implement the Metropolis update, with each node i 
replacing the neighboring values Xj{t — 1) by the estimates Xij_in- However, node i 
will not include all its neighbors in the Metropolis updates; rather, it will include 
only those which sent it a zero at time t (which means that 5;^ j-.in will not be too far 
away from Xj{t)) and whose iJij-.in are sufficiently far from Xi{t) so that the inevitable 
pertubration error inherent in using imprecise estimates does not affect the available 
bounds the available Lyapunov bounds for the Metropolis algorithm. 

Moreover, we introduce a stepsize of l/t^ into the Metropolis algorithm where 
(3 > a and (3 £ (0, 1). Indeed, by choosing /3 > a, we ensure that agents change their 
values Xj(t) slower than estimates Xi.j^n change. Intuitively, while the estimates Xij^in 
will get accurate by an additive factor of l/t" whenever they are very inaccurate, the 
values Xi (t) will have their movement attenuated by a factor of 1 /t^ . The introduction 
of the stepsize ensures that not only do the estimates "catch up" to the true 
values, but we can give a simple bound on how long it takes until estimates become 
accurate. 

2.2. Formal description of the protocol. The nodes begin with initial values 
Xi{0). Every node i will maintain variables i?ij-,in(i), 2?i.j,out(0 for every node which 
has been its neighbor in some past G{t). Node i initializes 




t=i 



XiJ^ini^ 1) — '^ij,out(^ 1) — 



at the first time t when the edge {i,j) belongs to E{t). 
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At each iteration < = 1, 2, 3, . . ., node i will send a value from the set {—1, 0, 1} 
to each of its neighbors j in G{t). The value i sends to j is 

q^^j(t) = R - 1) - x^,j,out{t - 1))] 

where 

■ 1 if a; > 1 
R\x] = iO if - 1 < x < 1 
-1 ifa;<-l 

Note that node i may send different messages to different neighbors. After sending 
Qi^j (t) to each neighbor j e Ni{t) and receiving qj^i(t) from the same, node i updates 
the values Xij,in(i), 2?ij,out(0 as 



and then updates its value Xi{t) as 

x.At)-x.At 1)+^, ^X.^^^^ AmMdm,d,it)) 

Here S{i,t) is the set of neighbors j of node i in G(t) which satisfy 

4 

and 



(2.1) 



2.3. Main result. We now provide the main convergence theorem for our pro- 
tocol. Wc begin with a few definitions needed to specify the class of graph sequences 
on which our protocol is guaranteed to work. 



Definition 2.1. We will call the graph sequence G(t) B- core- connected if there 
exists a set of edges Eoc C UtE{t) such that the graph ({1, . . . , n}, Eoc) is connected; 
and 

Eao C u['^]^^j^E{t) 

for every k. 



That is, a sequence is i3-core-connected if there is a set of edges, forming a 
connected graph, each of which appears in every interval [kB+1, {k+l)B]. We will say 
that the edges in £^00 are core edges. We note that this is a stronger assumption that 
the i?-conncctivity assumption typically made in the analysis of consensus algorithms. 
We will discuss this point in more detail shortly. 
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Our main result will show provide bounds until our consensus protocol reduces a 
certain measure of disagreement to a small value forever. We now define this measure 
(this is V2{t)) in the next definition) as well as several related concepts. 

Definition 2.2. 



max Xi 

i— l,...,n 



mm Xi 

i— l,...,n 



M{x) 
m{x) 

W{x) = M{x) - m{x) 
V2{x) 




Note that wih use the natural shorthands A/(t), m(t), W{t),V2{t) for M {x{t)), m{x{t)),W{x{t)),V2 {x{t)). 
We can now state the main result of this paper. 

Theorem 2.3. If0<a</3<1 then for all nodes i, initial values a;(0), and 
B- core- connected sequences G(t), it is true that 

1 " 

lim x.,{t) = - Vxj (O). 
t— >oo n 

i=i 

Moreover, i£] 

t>2T^ (2T^ \4{1 + B)W{0) + 23]!^ + (20S||a:(0)||oo)T^ + 26B + (128n''B)T^^ 
then 

V2{x{t)) < e 

for all 

Our result states that the consensus protocol of the previous section succeeds in 
computing the average on B-core-connected sequence and provides a bound for its 
convergence time. 

Remark 2.4. While the above convergence time expression is somewhat unwieldy, 
we emphasize that it is polynomial in n, e, _B, VF(0), V^(0), | |x(0)||oo for any choice of 
a and j3 satisfying the assumption < a < /? < 1. 

Remark 2.5. Observe that there is a tradeoff between the steady-state and tran- 
sient terms in the convergence time. In particular, as e ^ 0, the term (8n^'^/e)^^" 



\x~\ means the smallest integer which is at least x. 
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is going to dominate all the other terms in the convergence time expression. Conse- 
quently, to obtain the best asymptotic decay rate we should choose a close to 1 which 
means we must also choose (3 close to I. In that case, error decay at time t decays 
nearly as well as 0{n^'^ /t) as t approaches infinity. However, choosing q;„ — > 1 and 
Pn 1 causes the transient term to blow up, so that it takes longer and longer until 
the asymptotically dominant term dominates the other terms. 

In general, there is no single best choice of a, j3; rather every choice gives us 
a tradeoff between transient bounds and steady-state decay. For example, choosing 
a = 3/4, /3 = 7/8 gives us that the time until V2{x{t)) < e is 

o((B + W^(0) + BW^(0))^)+O(S||x(0)|U)^+O(n32s«)+max(^O(n32s«(logM^l)8,O 

Note that every term which does not have an e in it will become negligible as e — > 0. 
In this limit, the dominant term will be 0(^73) which will grow faster as e — >■ than 

the logarithmic term 0{n^^B^ log ) . 

On the other hand, choosing a ~ 1/4, /3 = 1/2 gives us that the time until 
V2{x{t)) < e is 

O ((B + W{0) + BW{0))^^ +0 (B||a;(0)|U)*^/^+O(n^S2)+max (^©(n^B^Qog M^)2^ q 



which has a worse asymptotic term ^ \ 7rj but the rest of the terms are smaller if 
B,W{0),\\x{0)\\oo,V2iO) are large. 

Remark 2.6. The analysis of consensus algorithms usually relies on a weaker 
notion of connectivity, namely the so-called B- connectivity (sometimes called uniform 
connectivity) condition: a sequence G{t) of undirected graphs is called B-connected if 
for each k, the graph 



[{l,...,n},utl'^f,Eit)) 



is connected. This encompasses a wider class of sequences compared to notion of 
B- core- connectivity which we use, which requires not only the above graph to be con- 
nected, but also that it has the same connected subgraph for every k. 

However, it is easy to see that our convergence proof works on B-connected se- 
quences rather than just B-core connected sequences; thus we have that the convergence 
result limf_^oo Xi{t) = i X]J=i ^i(O) holds for all B-connected sequence. We will jus- 
tify this point in a remark following the main proof. However, the convergence-time 
result is only known to hold for B-core- connected sequences, and it is an open problem 
whether one can obtain a similar convergence time on B-connected sequences. 

Remark 2.7. Our protocol requires node i to keep track of the numbers iij.in, iij.out 
for every other node j it interacts with. By contrast, the standard consensus algo- 
rithm keeps track of only a single real number, namely Xi{t) at ndoe i. Unfortunately, 
it seems that a ".storage blowup" phenomenon of this sort is unavoidable, though it 
remains an open question to prove this in a formal sense. 

As a consequence, our consensus protocol is is most attractive on graphs sequences 
G{t) in which every node interacts with a relatively small number of neighbors. One 
such example is the geometric random graph model of wireless networks, in which 
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sensors have locations in [0,1]^ and every node is connected to O(logn) neighbors []. 
Our consensus protocol will work in such networks, even if unpredictable interference 
or maliicous jamming makes links unreliable, with every node needing to store only 
O(logn) additional estimates. 

3. Main proof. This section is devoted to the proof of our main resuh, Theorem 
12.31 We win begin with a long sequence of lemmas which will establish properties of 
our consensus protocol, and put off the calculations that lead to the bound of Theorem 
12.31 until the last possible step. 

Our first lemma states the natural fact that the estimate iEj,i,out maintained by 
node j equals the estimate Jj,j,in maintained by node i. Intuitively, this means that 
node i knows node i's estimate of its value Xj{t). 

Lemma 3.1. 

whenever these variables are defined. 

Remark 3.2. Recall that the variables Xi_j^in{t) and Xj^i^outit) initialized and 
maintained at the first time (i,j) appears in the sequence G{t). 

Proof. The first time G E{t), the variables Xij-,in(^ — 1) and a;j,jout(^ ~ 1) 

are initialized to zero by both nodes i and j. Subsequently, they are updated as (see 
Section!^ 

(f\~9 ■ (f n I 



at each time t such that {i,j) G E{t). Consequently, they have the same value at all 
times. □ 

The next corollary tells us that if i uses Xi,j,in(i) in its Metropolis update of Eq. 
(|2.ip . then j uses %,i,in(i) in its own update at that time as well. Thus our protocol 
in a sense maintains the natural symmetry of the graphs G{t). 

Corollary 3.3. j G S{i,t) if and only ifi g S{j,t). 

Proof. Suppose j G S(i, t). This happens if and only if qi^j (t) — qj^iit) ~ and 

4 

\Xi,j,in{t) - X,j^out{t)\ > —. (3.1) 

The condition qi^j{t) ~ qj^i{t) = is naturally symmetric between i and j. More- 
over, using Lemma |3. 11 we have that Eq. (|3.ip is equivalent to 

_ _ 4 

\Xj,i,out{t) — Xj^i_in(i)| > — , 



SO that i G S{j,t). □ 



We next remark that the update equations of our protocols may be rewritten in 
more convenient matrix form. 

Remark 3.4. For i j, define 

Jo if3iS{i,t) 

Then 

jGS{i,t) 

We will subsequently require some bounds on the sizes of the weights Wij{t — 1) 
from the above equation. These are provided in the following lemma. 

Lemma 3.5. If Wij{t - 1) 7^ then 

^<w;y(t-l)<2 

Proof. Indeed, since Wij{t — 1) ^ we have that j e S{i, t) which means 

4 

\x^,jSn{i) - ^JJ,out(i)| > — (3.3) 

Moreover, we also have that qi^j{t) — 0, which means that 

\Xi,j,out{t) - Xi{t - 1)1 = |5:,;j,out(i - 1) - Xi{t - 1)1 < ^ 

and similarly, qj^i (t) = which means 

\Xi,j.init) - Xj{t - 1)1 = \xi,j.init - 1) - Xj{t - 1)| = |%,,,out(i " 1) - Xj{t - 1)| < ^. 



where the second equality used Lemma 13.1 1 We therefore have 

2 

\XiJM{t) - X^^J,out{t)\ - \Xj{t - 1) - Xi{t - 1)1 < — 

and combining this with Eq. p.3p . 

„,, _ T \ — Xij,in{t) — Xij,out{t) A 4 

which completes the proof. □ 

Remark 3.6. As a consequence of Lemma lS.ll we have that Wijit) = Wji{t) for 
all i, j, t. 

Our next lemma rewrites the update dynamics of our consensus protocol in still 
more convenient form. 
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Lemma 3.7. We may rewrite the main update Eq. 112.1]) as 
x(t) = (1 - ^)x{t - 1) + ^A{t - l)x{t - 1), 

where A{t— 1) is a symmetric, stochastic, diagonally dominant matrix with the prop- 
erty that ifaij{t— 1) is positive, then aij{t — 1) > 1/(8 max((ii(t), (ij(t))). 



Proof. From Eq. (jS^), 

" ' 4max(d,(t),d,(t)) ) + 4max{d,(t),d,(t))"^^* 



We therefore define A(t — 1) in the natural way from the above equation by setting 

^ _ Wy(t-l) 



4max(dj(t),dj(t)) 



for all i ^ j and 

1) 



au = l- ^ 



jes(».t) 4max(d,;(t),dj(i)) 

By construction the rows add up to 1. The off-diagonal entries are clearly nonnegative, 
and the diagonal entries are not just diagonal but at least 1/2 because Lemma [531 
implies that 

Amax{d,{t),d,{t)) - 2 

This makes A{t) nonnegative, stochastic, and diagonally dominant. Finally, Remark 
13.61 implies that A{t — 1) is symmetric. □ 

The next lemma lists some natural Lyapunov functions for our protocol. 

Lemma 3.8. M(t) is nonincreasing, m(t) is nondecreasing, and V2{x{t)) is non- 
increasing. 

Proof. The first two claims follow immediately from the stochasticity of A{t — 1). 
The last one follows from the symmetry of A - see ^11 for ^ proof. □ 

The following lemma esablishes that the sizes of the estimates possessed by the 
nodes cannot be too large. 

Lemma 3.9. For all i,j,t such that 5?^ j iii(t) is defined, 

\x^,J^n{t)\ < ||x(0)||oo. 



10 



Proof. By induction, 2?ij-,in(i) belongs to the convex hull of and Xj{0)^ Xj{l)^ Xj{2), . . . , 
Consequently, its absolute value cannot be larger than niax(| a; j(0)|, |a;j(l)|, . . . , 
By Lemma [3. 8[ this is at most ||x(0)||oo- D 



We now proceed to our first substantial lemma, which proves that after some 
finite number of steps which depends polynomially on the initial condition x{0), the 
estimates Xij^ia(t) associated with core links are close to the correct values 

x,(t-l). 

Lemma 3.10. // 

t > 2^ \8W{0) + 3B] + (20S||a;(0)||oo)^ + 2GB 

then 

\xi,jM{t) - Xj{t - 1)1 < (^^_2B-l)o' 

for every pair of nodes i,j such that the edge is a core edge. 

Proof. First note that as a consequence of Lemma 13.81 and 13.71 we can easily 
bound how much nodes change value from one step to the next. Indeed, Lemma 13.81 
implies that for any t, 

ma^\xkit) - xiit)\ <W{0), 
and therefore from Eq. (j3.2|) . 

We now use this observation to relate the message qj^i{t) to the estimation error 
\xi,j,in{t) — Xj{t — 1)|. Indeed, observe that qj^i{t) = 0, then 

\Xi,j,in{t) - Xj{t - 1)1 = \x^jSn{t " 1) - Xj{t - 1)| < 1. 

On the other hand, if qj^i{t) E { — 1, 1}, then 



At) - xjit - 1)1 < |J.,,i„(i - 1) - Xj{t - 1)1 - 1 

< \x,.j,Ut - 1) - xj{t - 2)1 + \x,{t - 1) - xj{t - 2) 

(l/2)H/(0) 1 



1 



< \x,,j,init-l)-Xj{t~2)\ + 



{t-1}^ t° 



Now if t is large enough so that t > 1 + (8W(0)) , we will then have the upper 
bound 14^(0) < (l/8)(t — l)/^-" -^yj^jQj^ allows us to eliminate W{0) from the above 
equation: we get that if qj^i{t) e { — 1, 1}, 



\x,,j,in{t)-Xjit~l)\ < |£.,,-.in(^-l)-3-j(t-2)|+ . -- < |J.,,,in l)-^;, (t-2) | 



1/16 1 ,^ / ^ / „M 7/8 

(i^-^ < \x.,Mit~iyx,{t^2)\—L- 

(3.4) 
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where we needed that t > 1 for the final inequafity. Thus, for large enough t, the 
estimatfon error \xij.in{t) ^Xj{t~ 1)| either decreases at each step or is already small. 
We now proceed to turn this into a quantitative bound on \xij^i-a{t) — Xj{t — 

Let A be the set of times when the link appears in the graph sequence. 

Suppose z > 1 + (8VF(0)) i^-" and t> z\hy considering the last element in A in [z,i\ 
we obtain the inequality 

|x»,j,i„W-a;,(t-l)| <max 2||a;(0)||- ^ t'=. TFF " ^ fc^ 

\ fceAn[z,t] ' "' feeAn[t'+i,t] 

(3.5) 

Note that to justify this inequality we needed the claim that \xi^j^i-a_{z—\)^Xj{z^2)\ < 
2||a;(0)||oo which is justified by appealing to Lemmas 13.81 and 13.91 

Next we invoke the assumption that every core edge that appears in each block 
of times [kB + 1, (fc + 1)B]. This means that because A has nonzero intersection with 
each [kB + 1, {k + l)B], we can infer that for any [a, b] such that a > 2B and b—a > 2B 
we hav^: 

^ k"" - 2B ^ fc" ■ 

feGAn[a,b] k=a 

We now plug this lower bound into Eq. p.5p . while simultaneously setting z = 
[(8W^(0) + ZB) ] (which is of course always at least 2B, strictly larger than 1, and 
strictly larger than 1 + (8VF(0))^~"). Note that the lower bound on t assumed in the 
statement of this lemma certainly implies t — (z + 1) > 2B; thus we have 



|Si,j,in(t)-a:j(t-l)| < 



( t t \ 

I ^ , , t'^l(3W(0)+3B) f-a l,...,t-2B-l fc=t' + l ^ ' 



(3.6) 

Our next step is to lower bound the sums appearing on the right-hand side of the 
above equation. Using the standard method of lower-bounding the sum of a nonin- 



^This inequality follows by some elementary manipulations, which we spell out in this footnote 
so as to avoid interrupting the flow of the above proof. Indeed, note that every interval of length 
2_B or more has at least one element from A. Moreover, since 1/fe" is a decreasing function of fc, we 
have that 

-> y ^ 

fcSAn[ii,6] fc>i such that a+k2B<b 

Now since a + 2B < 2a (due to the assumption a > 2B), we have that l/(a -I- 2B)" > l/(2a)" so 
that 

E ->(- E E ^— 

f-f , fc« - I 2 (a + 2B)" 2 (a + 2_B)« , 2k°' - , -f^ 2 • 2"k°' 

feeAn[a,b] \ ^ fe>2 such that a+k2B<b / fe>0 such that a+k2B<b 

which implies 

b 



y ±>±ylll 

^ k°: - 2B k" 

keAn[a,b] k — a 
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creasing function by an integral, we have 

^ fc" ~ 1 — a 1 — a 

h—a 

Now the lower bound on i we have assumed in the statement of this lemma implies 
that i > 2T^[8W^(0) + 3B]/J^ and t^^-^^/^ > 20B||x(0)||oo so that Eq. dSj]) implies 
that the first term in the maximization of Eq. p.6p is negative. As for the second 
term in the maximization, it is therefore bounded as 

1 1 ((i + 1)1-" - (f + 1)1-") 

max — 

t' = [8H'(0)+3B]4,...,t-2B-l (t')" lO-B 1 - a 

and since i! > 20 B, the expression inside the maximum is an increasing functiorH of t' . 
Consequently, we may choose t' = t — 2B — 1 in the second term, which immediately 
implies that 

\x^,J^n{t)~XJ{t~l)\< 



{t-2B- ly 



We next demonstrate a consequence of the previous lemma: if t is large enough 
and the message Qi-^j on a core edge is nonzero at some time, then the same message 
qi^j will be zero for a substantial amount of time whenever the edge (?,j) appears 
again. 

Lemma 3.11. Suppose 

t > 2^ [8(1 + B)W{0) +3B]i^ + (20B||x(0)||oo)^ + 20B + 1 

and the edge is a core edge. If qi^j{t) ^ then qi^j{k) = for all k g 

[t + l,t + 3-B] during which {i,j) appears. 



Proof. By Lemma 13.101 

1 

{t-2B- 2)" 

Consequently, qi^j {t) ^ implies that for each fee [t, t + 3i? — 1] 



|S?.,,,in(t - 1) - x,[t - 2)1 < (3.8) 



|x»jan(fc)"a;j(fc)| < i„(fc) - x,(fc - 1)| + ^^/^^^^^^ 



< |x,j-i„(t)-a-,(t-l)| + 3S 



(1/2)1^(0) 



- {t-2B - 2)" ^ 



^Indeed, if f{t) = c/t"' + d{t + 1)1"" where a e (0, 1) then 

f'(t) = -act-"-^ + d{l - a)(t + 1) 

The latter expression is nonnegativc whenever (t+i)<^ — ^ ' 
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Here the first two inequalities follow from the observation that Xj(t) and |2ci,j,in(0 ~ 
Xj{t — 1)1 can increase by at most '^^^'^'j^^^^^ as t increments by one; the third follows 
from Eq. (|3.4p : and the last one follows by plugging in Eq. p.8p . 

We claim the the assumption t > 2 [8(1 + B)W{0) + 35]/?^ + {20BVoo{0))^ + 
20B + 1 implies the final expression on the right is always at most l/{t + 33)". Indeed, 
this assumption implies that t > 20B so that t - 2B - 2 > O.BOt and t + 3B < l.lbt. 
Consequently, 

1 1 , 7/8 ^ 1/1.15°- 1/0.8" + 7/8 



{t + W)" {t-2B- 2)" 

> ^ 
- t° 

_ 3/8 w3-a 

> ^-^BW{Q) 

where the last step used our assumed lower bound on t which implies that > 
4:BW{Q). The last inequality is, after rearranging, precisely our claim that the right- 
hand side of Eq. is at most l/{t + SB)". 
Since 

\x.,Mk) - xm\ < j^^^ < ^ 

for aU k £ [i, t + 35 - 1], we have that q.i^j (fc) = for all fc G [t+l,t + 3B] when the 
edge appears. □ 

A corollary of this lemma is that every 3B steps, both messages qt^j and qj^i 
across a core link will be zero. We state this formally next. 

Corollary 3.12. Suppose the edge is a core edge and suppose 

t > 2T^[4(1 + S)W^(0) +2B] + (20S||x(0)||oo)^ + 20B. 
Then there exists a time in [t + l,t + 3B] with 



Proof. Indeed, since the edge {i,j) appears at least once in every period [kB + 
1, {k + l)i?], it follows it must appear at least thrice within the period + 1, < + 'iB]. 
If both qi^j and qj^i are zero during the first of those times, we are finished. If not. 
Lemma 13.111 implies they will be zero during either the second or the third of these 
times. □ 

Our next lemma forms the core of the proof our main theorem. Informally, it 
states that if the variance is not too low, it must satisfy a quantitative decrease 
bound. Before proceeding, we need to mention the standard relationship between 
W{t) and ¥2(1) which we will use: 



<V2{t)<./T^W{t). 



(3.10) 
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Lemma 3.13. Suppose that 

t> 2r^\4:{l + B)W{0) + 2B]i^ + (20B||x(0)||oo)^ + 20B 

and 

V2{x{t)) > (3.11) 

Then 

Proof. From Eq. p.lip . applying Eq. p.lOp we have, 

W{t)>S^. 

Because the subgraph consisting of core edges is connected, this imphes that there 
exist two nodes i,j with the edge being a core edge such that 

I m ^ W{x{t)) ^ V^ixjt)) 

\x^{t) - xj{t)\ > > g (3.12) 

It is an immediate consequence of the first incquahty in this chain that 

\x,{t)-x,{t)\> ^> ^. (3.13) 

Now our lower bound t > 2\4:{l + B)W{0) + 2B]1^ + (20B||a;(0)||oo)^ + 20B 
allows us to apply Corollary 13 . 121 and claim there exists a time t' in t' E [t+l,t + 3B] 
such that qi^j{t') = qj^iit') — 0. Moreover, since between times t and t' — 1, both 
Xi{t) and Xj{t) can change by at most 3B(l/2)H/(0)/t'^ < (l/2)/i". This means 
\x^{t' - 1) - Xj{t' - 1)1 > 7/r > 7/(t')" and so 

^ _ 7 14 

\x.^3,Ut') - x,,,, out(t')l > ^ - 2(^,_2B-1)" ^ (F)^' 

where we used t > 20B for the final inequality. The above equation implies that 
j € S{i,t') and consequently i G S{j,t'). 

We now proceed to lower bound the decrease from V2{x{t' — 1)) to V2{x{t')) due 
to the link between i and j at time t'. 

Since j G S{i,t') we have that aij{t' — 1) is positive, and therefore ay(i' — 1) > 
l/{8ma.x{di{t'),dj{t'))). Using the decomposition (see [21], [25] ) 

A^it' - 1) = / - - l)]„-(e, - e,)(e, - e,)^, 

i<j 

where as usual means the i'th basis vector, we have that 
V2{A{t' - l)x{t' - 1)) = {x{t' - 1) - x{t' - l)lfiAit' - l)f{xit' - 1) - xl) 

= V2{x{t' ~ 1)) - j2[A^{t' - i)ut)i^.it' - 1) - ^jit' - 1))^ 

i<2 
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and, since A{t'—1) is diagonally dominant and satisfies a,;j(t' — 1) > (1/8) max((i.j(t'), dj{t')), 
we have that [A^it' ~ 1)],^ > {l/{16max{di{t'),dj{t')); moreover, using Eq. 
we obtain 

V2{A{t'-l)x{t'~l)r<V2ix{t'-l)r ^ V{x{t'-l)r 



16max(dj(i'),dj(i')) 
so 



- l)xit' - 1)) < V.ixit' l))Jl-^ < V2{x{t' 1))(1 - -1^) (3.14) 



and therefore 



^2(a:(t')) < ((1 - ^)^2(x(t' - 1)) + ^V^iAit' - l)x(t' - 1))) 



<(1 



32n' 



(^)^2(^(t'-l)) 



and since V2{x{t)) is nonincreasing in t (Lemma 13. 8p and i' < t + 3i3, this concludes 
the proof. □ 



We are almost ready to begin the proof of our main Theorem 12.31 However, we 
first need the following lemma which will allow us to bound some of the expressions 
which will appear shortly. 

Lemma 3.14. The function f{k) — e'^/''»^''+^' ^-^f^ ^ ^ (0,1), d, e > is 
nondecreasing over the range k G [to,+oo) where to is any number which satisfies 

Proof. We have 

e(i/d)(fc+/r (c/d)(fc + fy-^k' - efc--ie(i/'')('=+/)' 



f'{k) = 
so /'(fc) > if 



de 
> — . 



ik + f) 



l-c 



We now proceed to the proof of our main result. Following the numerous lemmas 
which we have already established, the proof now follows by some straightforward 
analysis. 

Proof. [Proof of Theorem 12.3] Let us use the shorthand 

T = 2T^ [4(1 + B)W{0) + 25] + (20SFoo(0))t^ + 20B + {256n'^B)^ 
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and let us suppose t > T. If Eq. (|XTT|) holds at times t,t + 3B,t + 6B, . . . ,t + 3kB, 
then we can apply Lemma 13.131 to get 

V.i.it+3kB)) < ^2(.(t))(l- 3,^,(;^3^), )(1^ 3,^,(/^^^^, ) • • • (l- 3,„4(,|3fcg), ) 

which, using the inequality ln(l — x) < —x, implies 

V2{x{t + 3kB)) ^ _ 1 ^ {l/{3B)){t + SkB)^'/^ - {t + 3B)^"f^ 

V2{x{t)) ~ ~[ 327i4(t + 3jB)f^ - 32(1 - P)n^ 

which in turn implies 

V2ixit + 3kB)) < l/2(2;(i))e-((*+3'^-S)'-'-(*+3S)^-^)/(128(l-/?)n^B) 

Now recall that this bound is derived under the assumption that Eq. p. lip holds at 
times t,t + 3B, . . . ,t + 3kB. By considering the the last time after T when Eq. (|3.1ip 
holds, we get the unconditional upper bound 

V2{x{t)) < ma^(l/2(0)e-"*-=*^)"'-(™)"''''(^^«(^-«"'^', max ^^'^it--^^^'-' -i'^+^B)^-")/i^^sii-p)n^ b) Sn^'^ 
^ ^ - ^ ^ ^ T<k<t^e,B fc" {t — 6B)°' 

(3.15) 

Lets consider how long it takes the first term to fall below e. The inequality 

y^(^Q^^-{{t-3By-^-{T+3B}'-»)/{12Sil-P)n^B) < ^ 

is implied by 

(t - 3B)i-^ - (T + 3B)i-^ > 12871^5 log 



or 



t>'3B+[{T + iBf-f^ + 12871*5 log M^l 1 



which, using the inequality (a + bf < 2"-^ {a" + h") for k > 1, is in turn implied by 
t>3B + 2^ (r + 3B+ (128n*Blog— 



no)' 

Next, we consider the second maximum in our unconditional upper bound of Eq. 
(|3.15p . According to Lemma [3. 141 if T is large enough so that 

> 128r7'*Sa 



(T + 35) 



we may set A: = t — 6B in the maximization problem. But since T > 20B, we have 
that for this to be true it suffices that T > (1.15-128n4Ba)i/(i-^) which clearly holds. 
As a consequence, we finally have V2{x{t)) falls below e after 

6B + max ( ( T + 3B + (128n* B log —^)T^^ 



iterations. This is exactly the bound of Theorem 12.31 after a combination of terms 
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□ 



Remark 3.15. We now substantiate the claim, made earlier in the introduction, 
that the convergence result 



1 " 

lim Xi{t) = — Xj(0) 



n 

holds for all B- connected sequences. Indeed, our proof here proceeds by arguing that, 
after some transient period, the following event occurs every 0{B) steps: there is a 
link {i,j) at time t with both messages qi^j{t) and qj^i{t) equal to zero, such that the 
"gap" \xi{t) — Xj{t)\ is big (i.e., \xi{t) — Xj{t)\ > V2{x(t^3B))). To get this result, we 
needed to assume the existence of a connected set of core edges which occur in every 
[kB+l,{k+l)B]. 

If such a core set of edges does not exist, we still get the same conclusion but 
without an 0{B) bound on how long it takes for it to occur. This is sufficent to yield 
the convergence we claim, but not the convergence time bound in Theorem \2.3\ Indeed, 
let us assume without loss of generality that Xi(t) < X2(t) < ■ ■ ■ < Xn{t) and let us 
consider the largest "gap" \xi^i{t) — Xi(t)\. It is easy to see that there exists an i 
such that — Xi{t)\ > V2{x(t))/n^'^ . What we need to show is that there is a 

later time t' > t such that a node i' < i and a node j' > i + 1 have the property that 
q,,^j,{t')^qy^,,{t')=Q. 

This follows by induction. The claim is trivial for n ~ \. If it never occurs, 
the network effectively becomes disconnected into two strictly smaller networks, each 
achieving consensus by the inductive hypothesis. In that case, its easy to see that qi^j 
and qj^i eventually equal zero for all edges {i,j) which occur infinitely often. 

4. Simulations. Here wc report on some simulations of our consensus protocol. 
We will be focusing on seeing how the performance of our protocol scales with the 
time t for fixed n and seeing how the time to reach a certain level of accuracy scales 
with the number of nodes for various graph topologies. 

We will be simulating a slightly modified form of our protocol in which we omit 
the stepsize of 1/t^ , i.e, in which we set /3 = 0. The introduction of this stepsize is a 
mathematical necessity for our convergence result, but it does not appear practically 
necessary. Similarly, we will double the weight every agent places on its neighbors 
as our consensus protocol is likely too conservative. Thus we will be simulating the 
update 

^ ^.,1^,) 2max(d.(i),d,W) 

instead of Eq. ([2?T|) . 

In Figure |4?T1 we show the decay of the error max^ \xi{t) — ■^Y^^=i^ji^)\ with 
time for the complete graph and the line graph with a = 0.9. Much like our error 
bound predict, we see a slow decay which appears to be asymptotic to a power of 
t. We point out the jagged features of the graph, which are more prominent in the 
case of the line: it often takes some time for the estimates Xij_in to become accurate, 
which results in periods without any updates. 

In the next Figure 14. 2[ we are concerned with the time it takes our consen- 
sus protocol to achieve a certain level of error; we plot the time until max^ \xi{t) — 



18 



Error vs. number of iterations for the compiete graph on 20 nodes 



•>C<W>OOWCOCOOOC<XXXXX>(X>0<>0<X>(XXXXX>OCC 



10 20 30 40 50 60 70 80 

Number of iterations 



Error vs. number of iterations for a line graph on 20 nodes 



XXX)CCOOOOOO<XXX 



10 20 30 40 50 60 70 80 90 100 

Number of iterations 



Fig. 4.1. The left plot shows our consensus protocol on the complete graph on 20 nodes, while 
the right graph shows our consensus protocol on the line graph on 20 nodes. Both graphs show the 
error at time t, i.e., max^ \xi{t) — ^ Ej^i '^^ number of iterations t. Initial conditions 

are random for both graphs, and we have chosen a = 0.9 in both. 



Number of iterations needed to achieve an error tolerance vs. number of nodes for the complete graph 



Number of nodes 



Number of iterations for an error tolerance vs. number of nodes for the line graph 



Number of nodes 



Fig. 4.2. Both graphs show the time it takes for our protocol to reach an error of 0.05, i.e., 
the time until maxj \xi(t) — ^ — 0-05. The number of nodes is shown on the x-azis. 

The left picture is for the complete graph while the right picture is for the lien graph. The initial 
condition is the vector with 3:i(0) = 1 and a?fc(0) = 1 for k = 2, . . . ,n in both cases, and as before, 
a = 0.9. 



nX!j^i^j(0)| ^ 0-05 for the complete graph and the hne graph. As expected, for 
the complete graph, the time does not grow with n. We note how irregular this time 
is for the complete graph. Indeed, the time it takes to attain a certain level of ac- 
curacy often appears discontinuous for our protocol, since our protocol itself is also 
discontinuous. Finally, we see a reasonably slow growth with n for the case of the 
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line. 

Log iterations vs. number of nocjes for time-varying random iine grapti 

Error vs. number of iterations for a time-varying ranrjom line graph on 20 nodes 
0.65 1 1 1 1 1 1 1 1 1 1 
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Fig. 4.3. Both graphs show our protocol on a time-varying line graph which is randomly gener- 
ated at each stage. The left graph shows the decay o/maxi \xi{t) — ^ ^j(0)| with time while the 
right graph shows the logarithm of the time it takes to achieve maxi \xi{t) — ^ ^ 0.05. 
The initial condition is random on the left and equal to the vector xi{0) = 1 and Xf^{0) = 1 for 
k = 2, . . . , n on the right. In both cases, as before, a = 0.9. 

Finally, in Figure l4!3l we show our protocol on a time- varying graphs. We generate 
a random line at every stage by first relabeling the vertices uniformly at random and 
then connecting them in a standard line (i.e., putting an edge between 1 and 2, 2 and 
3, and so forth). We see that the protocol appears to work, but the convergence time 
is considerably slower than for either of our fixed graph examples. 

5. Conclusions. We have provided a consensus protocol in which nodes ex- 
change ternary messages at every step. In contrast to the previous literature, our 
protocol can handle time-varying graph sequences, requires the exchange of only 
finitely many bits at every step, comes with a polynomial-time convergence rate, 
and does not need knowledge of either the sequence G{t) or the number of agents to 
be implemented. 

Our paper naturally raises a number of open questions. First, our protocol uses 
more storage than the standard consensus algorithm due to the need by each node 
i to maintain the estimates ii j,in, ij,j,out- Is it possible to avoid this without using 
"unphysical" operations (for example, interleaving the digits of two real numbers to 
form a single real number)? 

Secondly, our convergence time result was demonstrated to hold on _B-core-connected 
graph sequences. However, most of the previous literature on consensus protocol has 
relied on the weaker assumption of B-connected graph sequences. An open ques- 
tion is to derive a similar convergence time result on the larger class of _B-connected 
sequences. 

Third, while the standard consensus protocol can be said to rely on broadcasts 
in the sense that each neighbor of node j measures or receives the same value Xj{t) 
at time t, our protocol relies on individualized messages from every node to each of 
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its neighbors. Wc wonder whether there exists a consensus protocol with the same 
properties as ours (finitely many bits exchanged at each step, polynomial convergence 
rate on time-varying graphs, does not require knowledge of the graph sequence or of 
the number of which agents) which relies only on broadcasts by each node at every 
step. 

Finally, the relation between speed, storage, and communication overhead in con- 
sensus is still poorly understood. There appear to be some tradeoffs involved between 
these quantities; for example, the current paper which puts aside the assumption that 
nodes can transmit real numbers derives a slower convergence time bound in contrast 
to the best average consensus convergence time [21] and has a larger amount of stor- 
age at each node. However, whether there are indeed any formal tradeoffs between 
these quantities is not clear at present. 
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